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Does Not Compty 
Gorrected Diskette Needed 



<110> APPLICANT: SIA, Charles D.Y. 
CaO/ Shi-Xian 
Per s son, Roy 
Rovinski , Benjamin 

<120> TITLE OF INVENTION: EXPRESSING GP140 FRAGMENT OF PRIMARY HIV-1 ISOLATE 
<130> FILE REFERENCE: 1038-1176 MrS:jb 
<140> CURRENT APPLICATION NUMBER: 09/914,205 
<l/ 3^> CURRENT PILING DATE: 2002-04-09 



CURRENT APPLICATION NUMBER; PCT/CAO 0/0 0190 



CURRENT FILING DATE: 2000-02-24 

PRIOR APPLICATION NUMBER: 09/256,194 
PRIOR FILING DATE: 1999-02-24 
NUMBER OF SEQ ID NOS : 17 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO: 1 
LENGTH: 1971 
TYPE: DNA 
ORGANISM: Human 
SEQUENCE : 1 



immunodeficiency virus type 1 



atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc 
tcggctagct tgtgggtcac agtctattat ggggtacctg tgtggaaaga 

tgctaaagca tatgatacag aagtacataa 



actctatttt gtgcatcaga 
acacatgcct gtgtacccac agaccccaac 
gaaaatttta acatggggaa aaataacatg 
ttatgggatc aaagcctaaa gccatgtgta 
tgcactaagt tgaagaatag tactgatacc 
aaaaactgct ctttcaacat cagcacaagt 
cttttttata gtcttgatat agtaccaata 
agttgtaata cctcaatcat tacacaggcc 



ccacaagaag tagtattggg 
gtagaacaga tgcatgaaga 
aaattaaccc cactctgtgt 
aataatacta gatggggaac 
gtaagaaata agatgaagag 
gataatgata atactagcta 
tgtccaaagg tatcctttga 



atacattttt gtgccccggc tggttttgcg attctaaagt gtaacaataa 



ggaacaggac catgtacaaa tgtcagcaca 
gtatcaactc aactgctgtt aaatggcagc 
gaaaatttca caaacaatgc taaaaccata 
aattgtacaa gacccaacaa caatacaaga 



gtacaatgta cacatggaat 
ctagcagaag aagaggtagt 
atagtacagc taaatgaatc 
aaaagtatac atataggacc 



ttttatacaa caggagatat aataggagat ataagacaag cacattgtaa 



acaaactgga ctaacacttt aaaaagggta 
acaacaatag tctttaatca atcctcagga 
aattgtggag gggaattttt ctactgtaat 
gaaactaaca gtgaaggaaa tatcaccagt 
caaattataa acatgtggca ggaagtagga 
caaattaaat gtttgtcaaa catcacaggg 
aacagtagta gtgggaaaga gatcttcaga 



gctgaaaaat taagagaaaa 
ggggacccag aaattgtaat 
acaacacaac tgtttaatag 
ggaactataa cactcccatg 
aaagcaatgt atgcccctcc 
ctgttattaa caagagatgg 
cctggagggg gagatatgag 



agaagtgaat tatataaata taaggtagta aaaattgaac cattaggaat 



agtcttcgtt 
agcaaccacc 
tgtttgggcc 
aaatgtgaca 
tataattagt 
tactttaaat 
acaagaaatg 
agaatatgca 
taggttaaga 
gccaattccc 
aacgttcaat 
taggccagta 
aattagatct 
tgtagaaatt 
agggagagca 
cattagtaga 
atttaataat 
gcacagtttt 
tacttggaat 
cagaataaaa 
catcggagga 
tggtagtgat 
ggacaattgg 
agcacccacc 
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55 aaggcaaaga gaagagtggt gcagagagaa 

56 cttgggttct tgggagcagc aggaagcact 

57 caggccagac aattattgtc tggtatagtg 

58 gaggcgcaac agcacctgtt gcaactcaca 

59 gtcctggctc tggaaagata cctacaggat 

60 ggaaaactca tctgcaccac tgctgtgcct 

61 agtcagattt gggataacat gacctggatg 

62 gagataatat atagcttaat tgaagaatcg 

63 ttattacaat tggataagtg ggcaagtttg 



66 <210> SEQ ID 


NO: 


2 






















67 <211> LENGTH 


: 657 






















68 <212> TYPE: 


PRT 
























69 <213> ORGANISM: 


Human iininunodef iciency virus 


type 1 






71 <400> SEQUENCE: 


2 






















72 Met 


Asp 


Ala 


Met 


Lys 


Arcr 


Gly 


Leu 


Cys 


Cys 


Val 


Leu 


Leu 


Leu 


Cys Gly 


73 1 








5 










10 










15 


75 Ala 


Val 


Phe 


Val 


Ser 


Ala 


Ser 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tvr 


Gly Val 


76 






20 










25 










30 




78 Pro 


Val 


Trp 


LVS 


Glu 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp Ala 


79 




35 










40 










45 






81 Lys 


Ala 


Tvr 
J. jf J. 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala Cys 


82 


50 










55 










60 








84 Val 


Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val Thr 


85 65 










70 










75 








80 


87 Glu 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


val 


Glu 


Gin 


Met 


His Glu 


88 








85 










90 










95 


90 Asp 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys Leu 


91 






100 










105 










110 




93 Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser Thr 


94 




115 










120 










125 






96 Asp 


Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys Ser 


97 


130 










135 










140 








99 Phe 


Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr Ala 


100 145 










150 










155 








160 


102 Leu 


Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr Ser 


103 








165 










170 










175 - 


105 Tyr 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys Pro 


106 






180 










185 










190 




108 Lys 


Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala Gly 


109 




195 










200 










205 






111 Phe 


Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly Pro 


112 


210 










215 










220 








114 Cys 


Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro Val 


115 225 










230 










235 








240 


117 Val 


Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu Val 


118 








245 










250 










255 


120 Val 


lie 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He Val 


121 






260 










265 










270 




123 Gin 


Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn Asn 



aaaagagcag tgggaatagg agccatgttc 1500 
atgggcgcag cgtcactaac gctgacggta 1560 
cagcagcaaa acaatttgct gagggctatt 1620 
gtctggggca tcaagcagct ccaggcaaga 1680 
caacggttcc tagggatgtg gggttgctct 1740 
tggaatgcta gttggagtaa taaaaatcta 1800 
gagtgggaga gagaaataag caattacaca 1860 
cagaaccaac aagaaaagaa tgaactagac 1920 
tggaattggt ttgacataac a 1971 
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124 



275 



280 



285 



126 Thr Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr 

127 290 295 300 

129 Gly Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg 

130 305 310 315 320 

132 Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu 

133 325 330 335 

135 Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp 

136 340 345 350 

138 Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 

139 355 360 365 

141 Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser 

142 370 375 380 

144 Glu Gly Asn lie Thr Ser Gly Thr lie Thr Leu Pro Cys Arg lie Lys 

145 385 390 395 400 

147 Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 

148 405 410 415 

150 Pro lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu 

151 420 425 430 

153 Leu Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie 

154 435 440 445 

156 Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

157 450 455 460 

159 Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly lie Ala Pro Thr 

160 465 470 475 480 

162 Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 

163 485 490 495 

165 Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 

166 500 505 510 

168 Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly 

169 515 520 525 

171 He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin 

172 530 535 540 

174 His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 

175 545 550 555 560 

177 Val Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met 

178 565 570 575 

180 Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn 

181 580 585 590 

183 Ala Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr 

184 595 600 605 

186 Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr 

187 610 615 620 

189 Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp 

190 625 630 635 640 

192 Leu Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He 

193 645 650 655 
195 Thr 

199 <210> SEQ ID NO: 3 
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200 <211> LENGTH: 10 

201 <212> TYPE: PRT 

202 <213> ORGANISM: Human immunodeficiency virus type 1 

204 <400> SEQUENCE: 3 

205 lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr 

206 1 5 10 

209 <210> SEQ ID NO: 4 

210 <211> LENGTH: 9 

211 <212> TYPE: PRT 

212 <213> ORGANISM: Human immunodeficiency virus type 1 

214 <400> SEQUENCE: 4 

215 Ala Tyr Asp Thr Glu Val His Asn Val 

216 1 5 

219 <210> SEQ ID NO: 5 

220 <211> LENGTH: 9 

221 <212> TYPE: PRT 

222 <213> ORGANISM: Human immunodeficiency virus type 1 

224 <400> SEQUENCE: 5 

225 Phe Tyr Ser Leu Lys lie Val Pro lie 

226 1 5 

229 <210> SEQ ID NO: 6 

230 <211> LENGTH: 9 

231 <212> TYPE: PRT 

232 <213> ORGANISM: Human immunodeficiency virus type 1 

234 <400> SEQUENCE: 6 

235 Leu Tyr Lys Tyr Lys Val Val Lys lie 

236 1 5 

239 <210> SEQ ID NO: 7 

240 <211> LENGTH: 10 

241 <212> TYPE: PRT 

242 <213> ORGANISM: Human immunodeficiency virus type 1 
244 <400> SEQUENCE: 7 

24 5 Lys Tyr Lys Val Val Lys lie Glu Pro Leu 
246 1 5 10 

249 <210> SEQ ID NO: 8 

250 <211> LENGTH: 9 

251 <212> TYPE: PRT 

252 <213> ORGANISM: Human immunodeficiency virus type 1 

254 <400> SEQUENCE: 8 

255 Arg Tyr Leu Gin Asp Gin Arg Phe Leu 

256 1 5 

259 <210> SEQ ID NO: 9 

260 <211> LENGTH: 9 

261 <212> TYPE: PRT 

262 <213> ORGANISM: Human immunodeficiency virus type 1 

264 <400> SEQUENCE: 9 

265 Asn Tyr Thr Glu lie lie Tyr Ser Leu 

266 1 5 
269 <210> SEQ ID NO: 10 
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270 <211> LENGTH: 9 

271 <212> TYPE: PRT 

272 <213> ORGANISM: Human immunodeficiency virus type 1 

274 <400> SEQUENCE: 10 

275 Lys Leu Thr Pro Leu Cys Val Thr Leu 

276 1 5 

279 <210> SEQ ID NO: 11 

280 <211> LENGTH: 9 

281 <212> TYPE: PRT 

282 <213> ORGANISM: Human immunodeficiency virus type 1 

284 <400> SEQUENCE: 11 

285 Thr Leu Phe Arg Val Ala lie Lys Leu 

286 1 5 

289 <210> SEQ ID NO: 12 

290 <211> LENGTH: 9 

291 <212> TYPE: PRT 

292 <213> ORGANISM: Human immunodeficiency virus type 1 

294 <400> SEQUENCE: 12 

295 Thr Leu Thr Val Gin Ala Arg Gin Leu 

296 1 5 

299 <210> SEQ ID NO: 13 

300 <211> LENGTH: 9 

301 <212> TYPE: PRT 

302 <213> ORGANISM: Human immunodeficiency virus type 1 

304 <400> SEQUENCE: 13 

305 Thr Leu Thr Val Gin Ala Arg Ala Leu 

306 1 5 

309 <210> SEQ ID NO: 14 

310 <211> LENGTH: 9 

311 <212> TYPE: PRT 

312 <213> ORGANISM: Human immunodeficiency virus type 1 

314 <400> SEQUENCE: 14 

315 Gin Leu Gin Ala Arg Val Leu Ala Leu 

316 1 5 

319 <210> SEQ ID NO: 15 

320 <211> LENGTH: 1983 " 

321 <212> TYPE: DNA 

322 <213> ORGANISM: Human immunodeficiency virus type 1 

324 <400> SEQUENCE: 15 

325 gatccaccat ggatgcaatg aagagagggc tctgctgtgt gctgctgctg tgtggagcag 60 

326 tcttcgtttc ggctagcttg tgggtcacag tctattatgg ggtacctgtg tggaaagaag 120 

327 caaccaccac tctattttgt gcatcagatg ctaaagcata tgatacagaa gtacataatg 180 

328 tttgggccac acatgcctgt gtacccacag accccaaccc acaagaagta gtattgggaa 240 

329 atgtgacaga aaattttaac atggggaaaa ataacatggt agaacagatg catgaagata 300 

330 taattagttt atgggatcaa agcctaaagc catgtgtaaa attaacccca ctctgtgtta 360 

331 ctttaaattg cactaagttg aagaatagta ctgataccaa taatactaga tggggaacac 420 

332 aagaaatgaa aaactgctct ttcaacatca gcacaagtgt aagaaataag atgaagagag 480 

333 aatatgcact tttttatagt cttgatatag taccaataga taatgataat actagctata 540 

334 ggttaagaag ttgtaatacc tcaatcatta cacaggcctg tccaaaggta tcctttgagc 600 
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